Building Topic Specific Language Mo Competitive Mo
نویسندگان
چکیده
The ability to build topic specific language models, rapidly and with minimal human effort, is a critical need for fast deployment and portability of ASR across different domains. The World Wide Web (WWW) promises to be an excellent textual data resource for creating topic specific language models. In this paper we describe an iterative web crawling approach which uses a competitive set of adaptive models comprised of a generic topic independent background language model, a noise model representing spurious text encountered in web based data (Webdata), and a topic specific model to generate query strings using a relative entropy based approach for WWW search engines and to weight the downloaded Webdata appropriately for building topic specific language models. We demonstrate how this system can be used to rapidly build language models for a specific domain given just an initial set of example utterances and how it can address the various issues attached with Webdata. In our experiments we were able to achieve a 20% reduction in perplexity for our target medical domain. The gains in perplexity translated to a 4% improvement in ASR word error rate (absolute) corresponding to a relative gain of 14%.
منابع مشابه
Ultralow-density nanocage-based metal-oxide polymorphs.
For two important metal oxides (MO, M=Mg, Zn) we predict, via accurate electronic structure calculations, that new low-density nanoporous crystalline phases may be accessible via the coalescence of nanocluster building blocks. Specifically, we consider the assembly of cagelike (MO)_{12} clusters exhibiting particularly high gas phase stability, leading to new polymorphs with energetic stabiliti...
متن کاملCompetitive Supramolecular Encapsulation of Amphiphilic Hyper- branched Polymers Made from A2 and BB’2 Type Monomers. 1. Polyaddi- tion of 1-(2-aminoethyl)piperazine to Divinyl Sulfone
The competitive host-guest encapsulations of palmityl chloride-modified amphiphilic hyperbranched poly(amido-amine) (HPAMAM-PC) and poly(sulfone-amine) (HPSA-PC) to selected pairs of dyes are reported. Watersoluble and chloroform-insoluble dyes, such as methyl orange (MO), methyl blue (MB), rose bengal (RB), fluorescein sodium (FSS), eosin Y (EY) and phloxine B (PB), can be transferred from aqu...
متن کاملBuilding topic specific language models from webdata using competitive models
The ability to build topic specific language models, rapidly and with minimal human effort, is a critical need for fast deployment and portability of ASR across different domains. The World Wide Web (WWW) promises to be an excellent textual data resource for creating topic specific language models. In this paper we describe an iterative web crawling approach which uses a competitive set of adap...
متن کاملCompetitive Adsorption of Molybdenum in the Presence of Phosphorus or Sulfur on Gibbsite
Anion adsorption on the aluminum oxide, gibbsite, was investigated as a function of solution pH (3Y11) and equilibrium solution Mo (3.13, 31.3, or 313 Kmol/L), P (96.9 Kmol/L), or S (156 Kmol/L) concentration. Adsorption of all three anions decreased with increasing pH. Electrophoretic mobility measurements indicated a downward shift in point of zero charge, indicative of an inner-sphere adsorp...
متن کاملDelineation of enriched zones of Mo, Cu and Re by concentration-volume fractal model in Nowchun Mo-Cu porphyry deposit, SE Iran
The purpose of this study is to identify the enriched zones of Cu, Mo and Re in Nowchun Mo-Cu porphyry deposit (SE Iran) based on subsurface data and using of concentration–volume (C–V) fractal model. The C-V model illustrates four and five geochemical zones based on Mo and Cu distributions respectively and there are three geochemical populations for Re. The main mineralization for Mo, Cu and R...
متن کامل